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Abstract. 

In this paper we obtain a Bernstein type inequality for the sum of self-adjoint centered 
and geometrically absolutely regular random matrices with bounded largest eigenvalue. This 
inequality can be viewed as an extension to the matrix setting of the Bernstein-type inequality 
obtained by Merlevede et al. (2009) in the context of real-valued bounded random variables 
that are geometrically absolutely regular. The proofs rely on decoupling the Laplace transform 
of a sum on a Cantor-like set of random matrices. 

1 Introduction 

The analysis of the spectrum of large matrices has known significant development recently 
due to its important role in several domains. One of the questions is to study the fluctuations of 
a Hermitian matrix X from its expectation measured by its largest eigenvalue. Matrix concen¬ 
tration inequalities give probabilistic bounds for such fluctuations and provide effective methods 
for studying several models. The Laplace transform method, which is due to Bernstein in the 
scalar case, was generalized to the sum of independent Hermitian random matrices by Ahlswede 
and Winter in [2], The starting point is that the usual Chernoff bound when we deal with the 
partial sums associated to real-valued random variables has the following counterpart in the 
matrix setting: 

n 

P (A max ( X ^ x ) ^ ' ETr (e* ^ Xi )} ( 1 ) 

i =1 

(see 0). Here and all along the paper, (Xj)j>i is a family of d x d self-adjoint random matrices. 
The main problem is then to give a suitable bound for L n (t ) := ETr^e^fe 1 ^). In the inde¬ 
pendent case, starting from the Golden-Thompson inequality stating that if A and B are two 
self-adjoint matrices, 

Tr(e A+B ) < Tr (e A e B ) , 

Ahlswede and Winter have observed that 

ETr(e^= lXi ) < A max (E(e tx ")) • ETr(e tE fei 1Xi ) (2) 

and gave a bound for L n (t ) by iterating the procedure above. In [IT], TYopp used Lieb’s concavity 
theorem (see 0) to improve the bound on L n (t ) stated in [2] and obtained Lemma |4] of Section 
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ixn This lemma then allows to extend to the matrix setting the usual Bernstein inequality for 
the partial sum associated with independent real-valued random variables. 

Let us mention that an extension of the so-called Hoeffding-Azuma inequality for matrix 
martingales and of the so-called McDiarmid bounded difference inequality for matrix-valued 
functions of independent random variables is also stated in [Hi- 

Taking another direction, Mackey et al. m extended to the matrix setting Chatterjee’s 
technique for developing scalar concentration inequalities via Stein’s method of exchangeable 
pairs (see 0] and [5|), and established Bernstein and Hoeffding inequalities as well as concentra¬ 
tion inequalities. Following this approach, Paulin et al. HU established a so-called McDiarmid 
inequality for matrix-valued functions of dependent random variables under conditions on the 
associated Dobrushin interdependence matrix. 

The aim of this paper is to give an extension of the Bernstein deviation inequality when 
we consider the largest eigenvalue of the partial sums associated with self-adjoint, centered and 
absolutely regular random matrices with bounded largest eigenvalue. This kind of dependence 
cannot be compared to the dependence structure imposed in [10] or in HU. 

Note that for dependent random matrices, the first step given by (0]) of the iterative procedure 
in |2] fails as well as the concave trace function method used in m Therefore additional 
transformations on the Laplace transform have to be made. Even in the scalar dependent 
case, obtaining sharp Bernstein-type inequalities is a challenging problem and a dependence 
structure of the underlying process has obviously to be precise. For instance, Adamczak [1J 
proved a Bernstein-type inequality for the partial sum associated with bounded functions of a 
geometrically ergodic Harris recurrent Markov chain. He showed that even in this context where 
it is possible to go back to the independent setting by creating random iid cycles, a logarithmic 
extra factor (compared to the independent case) cannot be avoided (see Theorem 6 and Section 
3.2 in HI). 

In nu and HU , Merlevede et al. considered more general dependence structures than Harris 
recurrent Markov chains and proved Bernstein-type inequalities for the partial sums associated 
with bounded real-valued random variables whose strong mixing coefficients (or T-dependent 
coefficients) decrease geometrically or sub-geometrically. Note that in [12| , the case of real¬ 
valued random variables that are not necessarily bounded is also treated. The method used 
in both papers mentioned consists of partitioning the n random variables in blocks indexed by 
Cantor-type sets plus a remainder. The idea is then to control the log-Laplace transform of each 
partial sum on the Cantor-type sets. The log-Laplace transform of the total partial sum is then 
handled with the help of a general result which provides bounds for the log-Laplace transform of 
any sum of real-valued random variables (see our Lemma 0 in the context of random matrices). 
Obviously, the main step is to obtain a suitable upper bound of the log-Laplace transform of the 
partial sum on each of the Cantor-type set. The dependence structure assumed in HU or [12] 
allow the following control: for any index sets Q and Q' of natural numbers such that Q C [1, p\ 
and Q' C [n + p, oo) and any t > 0, 


E(i 
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*eQ ,Jfi ) < E(e t ^-' ie< 3 Xi )E(e t ^ i6< 5' Js:i ) + e(n)||e 4 
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( 3 ) 


where e(n) is a sequence of positive real numbers depending on the dependent coefficients. The 
binary tree structure of the Cantor-type sets allows to iterate the decorrelation procedure above- 
mentioned allowing then to suitably handle the log-Laplace transform of the partial sum on each 
of the Cantor-type set. 

In the random matrix setting, iterating a procedure as Q cannot lead to suitable exponential 
inequalities essentially due to the fact that the extension of the Golden-Thompson inequality to 
three or more Hermitian matrices fails, and then the extension of the exponential inequalities 
stated in m and HU to the matrix setting is not straight forward. To benefit of the ideas 
developed in [U or in HU. we shall rather bound the log-Laplace transform of the partial sum 
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indexed by a Cantor-type set, say K , by the log-Laplace transform of the sum of 2^ independent 
and self-adjoint random matrices plus a small error term (here i depends on the cardinality of 
K). Lemma [8] is in this direction and can be viewed as a decoupling lemma for the Laplace 
transform in the matrix setting. As we shall see, a well-adapted dependence structure allowing 
such a procedure is the absolute regularity structure. Indeed, the Berbee’s coupling lemma 
(see Lemma [6] below) allows a ’’good coupling” in terms of absolute regular coefficients (see the 
definition 0) even when the underlying random variables take values in a high dimensional 
space (working with d x d random matrices can be viewed as working with random vectors of 
dimension d 2 ). The decoupling lemma [8] associated with additional coupling arguments will then 
allow us to prove our key Proposition [7] giving a bound for the Laplace transform of the partial 
sum indexed by Cantor-type set of self-adjoint random matrices. As we shall see, our method 
allows to extend the scalar Bernstein type inequality given in nn to the matrix setting. 

Our paper is organized as follows. In Section [2], we introduce some notations and definitions 
and state our Bernstein-type inequality for the class of random matrices we consider (see The¬ 
orem d). Section [3] is devoted to some examples of matrix models where this Bernstein-type 
inequality applies. The proof of the main result is given in Section [7J 

2 Main Result 

For any d x d matrix X = [(X)j ; j]A =1 whose entries belong to IK = M or C, we associate its 
corresponding vector X in W 1 ' whose coordinates are the entries of X i.e. 

X = ((X)jj , 1 < i < • 

Therefore X = , 1 < i < d 2 ) where 

Xi = (X);.^.!)^ for (j - l)d + 1 < i < jd, 

and X will be called the vector associated with X. Reciprocally, given X = , 1 < l < d 2 ) in 

K cp we shall associate a d x d matrix X by setting 

x = [( X )m]”j=i where ( x )*j = x i+{j-i)d ■ 

The matrix X will be referred to as the matrix associated with X. 

In all the paper we consider a family (Xj)oi of d x d self-adjoint random matrices whose en¬ 
tries are defined on a probability space (fl, A, P) and with values in K, and that are geometrically 
absolutely regular in the following sense. Let 

A) = 1 and P k = sup /?(cr(X, ; , i < j), cr(X; , i A j + k)), for any k > 1, (4) 

3>l 


where 

P(A,B) = ^sup{ XX |P(Ai nBj)- P(Ai)P(Sj)|} , 

iei jeJ 

the maximum being taken over all finite partitions (Aj)j G / and (Rj)j G j of Q respectively with 
elements in A and B. 

The o ar e usually called the coefficients of absolute regularity of the sequence of vectors 

(Xj)i>i and we shall assume in this paper that they decrease geometrically in the sense that 
there exists c > 0 such that for any integer k A 1, 

Pk = sup P(a(Xi , i < j),<r(Xi , i A j + k)) < e _c(fe_1) , (5) 

m 
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Note that the /3k coefficients have been introduced by Kolmogorov and Rozanov [ 8 | and even if 
they are more restrictive than the so-called Rosenblatt strong mixing coefficients a & they can 
be computed in many situations. For instance, we refer to the work by Doob [ 6 ] for sufficient 
conditions on Markov chains to be geometrically absolutely regular or by Mokkadem [T3j for 
mild conditions ensuring vector ARMA processes to be also geometrically /3-mixing. 

In all the paper, we will assume that the underlying probability space (0, A, P) is rich enough 
to contain a sequence (e*)jez = °f hd random variables with uniform distribution over 

[0,1] 2 , independent of (Xj)i>o- In addition, the following notations will be used : logic := In x, 
log 2 x = , we write 0 for the zero matrix and for the d x d identity matrix, we use the curly 

inequalities to denote the semidefinite ordering i.e. 0 Y X means that X is positive semidefinite. 


Theorem 1 Let (Xj)j^i be a family of self-adjoint random matrices of size d. Assume that © 
holds and that there exists a positive constant M such that for any i > 1, 


E(Xi) = 0 and A max (Xj) < M almost surely. 


( 6 ) 


Then there exists a universal positive constant C such that for any x > 0 and any integer n ^ 2 , 

Cx 2 


^max (X Xj) > xj < dexp ( — 


i= 1 


v 2 n + c 1 M 2 + xM'y(c,n) J 


where 


and 


v 2 = 


sup 


K c{i,...,n} Card AT 


/e(X x 7 


ieK 


7 (c, n) 


log n 
log 2 


max 



321ogn\ 
clog 2 / 


(7) 

( 8 ) 


In the definition of v 2 above, the maximum is taken over all nonempty subsets I\ C {1,..., n}. 

To prove the deviation inequality stated in Theorem [I] we shall use the matrix Chernoff 
bound ©• The theorem will then follow from the following control of the matrix log-Laplace 
transform that is proved in Section [4.31 Under the conditions of Theorem [1] for any positive t 
such that tM < 1 / 7 ( 0 , n), we have 


n 

log ETr ^ exp (f / ( X/ j ^ ^7 log d 

1 1 


t 2 n(l5v + 2 M / (cn) 1 / 2 ) 2 
1 — tM^(c, n) 


As proved in Section 4.2.4 of m, this inequality together with Jensen’s inequality leads to 
the following upper bound for the expectation of the largest eigenvalue of ^C / =1 X 7 Under the 
conditions of Theorem [T1 


n 

EA max ^ ^ Xi) < 30 uy/n logd + 4Mc~ 1 ^ 2 y^log d + Mj(c, n ) log d. 
1=1 


3 Applications 

Let (jkjk be a stationary sequence of real-valued random variables such that HtiHoq < 1 a.s. 
Consider a family (Y*.)*. of independent real and symmetric d x d random matrices which is 
independent of (rk)k- For any i = 1,..., n, let Xj = and note that in this case 

Pk = P{cr(n , i < 0), a{ri , i > k)). 
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Corollary 2 Assume that there exists a positive constant c such that /?& < e c ^ k ^ for any 
k>l and suppose that each random matrix Yj. satisfies 

EYfc = 0 , A max (Yfc) < M and A m ; n (Yfc) > —M almost surely. 

Then for any t > 0 and any integer n >2, 

( n \ ( Ct 2 \ 

P ( A "“(g T ‘ ¥ ‘) -*■)- “ eXP ( “ nM^) + M^ + t M(logn)0 ’ 

where C is a positive constant depending only on c. 

Proof. The above corollary follows by noting that for any K C {1,..., n} 

:= E( E nVk ) 2 = E E ( T fc) E ( Y i) = E ( t o) E E ( Y i)’ 

keK k£K k£K 

which, by Weyl’s inequality, implies that A max (S/ 1 ') < M 2 Card(Ii )E(tq ). Therefore, we infer 
that v 2 < M 2 E(tq). □ 


We consider now another model for which Theorem |Tj can be applied. Let (W)fcez be a 
geometrically absolutely regular sequence of real-valued centered random variables. That is, 
there exists a positive constant Co such that for any k > 1, 

sup/3(cr(Xj, i < £), a(Xi , i > k + £)) < e -c o( fc_1 ) . (9) 

t&z 

For any % = 1,..., n, let Xj be the d x d random matrix defined by X* = C,;CP — E(CjCj’) where 
C i = ■ ■ • ,Xid) T . Note that in this case, 

(3k = sup/3(cr(Cj, i < £),a(Ci , i > £ + k)) < e ~ c o rf (fc-i) _ 


for any k > 1. 

Corollary 3 Assume that (Xk)k satisfies ([9]). Suppose in addition that there exists a positive 
constant M satisfying sup fc HX^Hoq < M a.s. Then, for any x > 0 and any integer n > 2 

/ ”• s , (J x 2 s 

P ( A max (V Xj) > x ) < d exp (-----—- a —- ) , 

v / \ ndM 4 + dM 4 + xM 2 (d log n + log 2 n)' 

where C is a positive constant depending only on cq . 

Proof. For any i € {1,..., n}, note that A max (Xj) < A max (C ? ;Cf) implying that A max (X, ; ) < dM 2 
a.s. To get the desired result, it remains to control v 2 . We have for any K C {1,... , N }, 

S i ,:=E(EXi) 2 = E Cov(CjCf, CjCj) 

i£K i,j&K 


and we note that the ( k,i) th component of Sj- is 


(^A')y = 


e (E x <) 2 

i£K 


k,e 


d 

^ ^ " Cov(X( i _ 1 ) £ j+fc X(j_l)ci+s j X(j- \)d-\r.s — ■ 

i,j£K s=l 


Therefore we infer by Gerschgorin’s theorem that 


|A max (XA') | 

d d d 

^ sup E \{fP K )k,i\ ^ sup E EEI Cov (y — l)d+fc l)d+s j X(j—l)d+s X(j—i)d+i) | ' 

k 1= 1 k i,jeK 1= 1 s=l 

After tedious computations involving Ibragimov’s covariance inequality (see 0 ), we infer that 
v 2 < cidM 4 where c\ is a positive constant depending only on cq. Applying Theorem |T| with 
these upper bounds ends the proof. □ 
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4 Proof of Theorem |1] 


The proof of Theorem [T] being very technical, it is divided into several steps. In Section 14.11 
we first collect some technical preliminary lemmas that will be necessary all along the proof. 
In Section 14.21 we give the main ingredient to prove our Bernstein-type inequality, namely: a 
bound for the Laplace transform of the partial sum, indexed by a suitable Cantor-type set, of 
the self-adjoint random matrices under consideration (see Proposition [7] and Section [T2T] for the 
construction of this suitable Cantor-set). As quoted in the introduction, this key result is based 
on a decoupling lemma which is stated in Section 14.2.21 The proof of Theorem |T] is completed 
in Section fOl 

4.1 Preliminary materials 

The following lemma is due to Tropp HZ], Under the form stated below, it is a combination of 
his Lemmas 3.4 and 6.7 together with the proof of his Corollary 3.7. 

Lemma 4 Let K be a finite subset of positive integers. Consider a family (U k)keK of d x d 
self-adjoint random matrices that are mutually independent. Assume that for any k E K , 

E(U fc ) = 0 and A max (U fc ) < B a.s. 

where B is a positive constant. Then for any t > 0, 

ETr (e^^^ Ufc ) < dexp (t 2 g(tB) A max ( E(U|))) , (10) 

k&K 


where g(x) = x 2 (e x — x — 1 ). 


The next lemma is an adaptation of Lemma 3 in [I2J to the case of the log-Laplace transform 
of any sum of d x d self-adjoint random matrices. 


Lemma 5 Let Uo,Ui,... be a sequence of d x d self-adjoint random matrices, 
there exists positive constants 07,07,... and ko,ki,... such that, for any i > 0 
[0, 1/k*[, 

logETr(e iUi ) <C d + (ait ) 2 /(I - nfi ), 


Assume that 
and any t in 


where C d is a positive constant depending only on d. Then, for any positive n and any t in 
[0, 1/(kq + /ci + • • • + n n )[, 


logETr(e*^£=o Ufc ) <C d + (at) 2 /(l - nt), 


where a = a® + a\ + • • • + a n and n = kq + k± + • • • + n n . 

Proof. Lemma [5] follows from the case n = 1 by induction on n. For any t > 0, let 

L(t) = logETr(e t(Uo+Ul) ) 


and notice that by the Golden-Thompson inequality, 

L(t)< logETr(e tUo e tUl ) . (11) 


Define the functions 7 j by 

7 i(t) = (°'j i ) 2 /(l - K fi) f° r t E [ 0 , 1 /k i[ and 7 fit) = +00 for t > I/7*, 
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and recall the non-commutative Holder inequality (see for instance exercise 1.3.9 in |16]): if A 
and B are d X d self-adjoint random matrices then, for any 1 < p, q < oo with p _1 4- q = 1, 

\Tv(AB)\ < P|| 5p ||H|| 59 , (12) 

where ||A||,sp = ||(Aj(A))^ =1 ||^p = ^Sf=i |Aj(A)| p ^ (resp. ||i?||s<j) is the p-Schatten norm of 
A (resp the q-Schatten norm of B). 

Starting from m and applying (fl2l) with A = e tu ° and B = e tu °, we derive that for any 
t > 0 and any p €] 1, oo [ 

L(t) <logE( ||e tUo || 5 p||e* Ul || 59 ) , 
which gives by applying Holder’s inequality 

L(t ) < p~ l log E||e f u ° ||^p + g” 1 log E||e tUl ||^ 9 . 

Observe now that since Uo is self-adjoint 

d d 

||e* Uo ||£ p = |Aj(e tu °)| p = J2 Ai(e* pUo ) = Tr(V pUo ) , 

i =1 i =1 

and similarly ||e* Ul ||^q = Tr(e tl?Ul ). So, overall, 

L(t) <p~ l logETr(e* pU °) + q" 1 log ETr (e t<?Ul ) . (13) 

For any t in [0, l/re[, take ut = (cro/cr)(l — ref) + kq t (here re = kq + «i and a = <7 0 + ci). With 
this choice 1 — ut = (<7i/cr)(l — ret) + n\t, so that Ut belongs to ]0,1[. Applying Inequality (fT3l) 
with p = 1/ut, we get that for any t in [0, l/re[, 

L{t) < u t j 0 (t/ut) + (1 - rt t ) 7 i(f/(l - u t )) = (cnt) 2 /(1 - ret), 

which completes the proof of Lemma 0 □ 

Next lemma allows coupling and is due to Berbee [3]. 

Lemma 6 Let X and Y be two random variables defined on a probability space (fi, .A, P) and 
taking their values in Borel spaces B\ and B 2 respectively. Assume that (H,A., P) is rich enough 
to contain a random variable 5 with uniform distribution over [0,1] independent of (X, Y). 
Then there exists a random variable Y* = f(X, Y. 5) where f is a measurable function from 
B\ x B 2 x [0,1] into B 2 such that Y* is independent of X, has the same distribution as Y and 

F(Y^Y*) = fi(a(X),a(Y)). 

Let us note that the /3-mixing coefficient /3(a(X),a(Y)) has the following equivalent definition: 

/3 (a(X),a(Y)) = ±|| P x ,y - P x <g> Py|| , (14) 

where Px,y is the joint distribution of ( X , Y) and Px and Py are respectively the distributions 
of X and Y and, for two positive measures p and v, the notation \\p — v || denotes the total 
variation of p — u. 
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4.2 A key result 

The next proposition is the main ingredient to prove Theorem [T] It is based on a suitable 
construction of a subset K A of {1,... , A} for which it is possible to give a good upper bound for 
the Laplace transform of Yli^K A P ro °f is based on the decoupling Lemma [5] below that 

allows to compare ETr (e* E * sx a x “) with the same quantity but replacing Y2ieK A with a sum 
of independent blocks. 

Proposition 7 Let (Xj)j^i be as in Theorem\l\ Let A be a positive integer larger than 2. Then 
there exists a subset Ka of {1,, A} with Card(A'yi) ^ A/2, such that for any positive t such 
that tM < min (±, , 

logETr(V Ei6X A Xi ) < logd + 4 x 3.1 t 2 Av 2 + 9 (^ f ) e -3c/(32tM) , (15) 

where v 2 is defined in 0- 

The proof of this proposition is divided into several steps. 


4.2.1 Construction of a Cantor-like subset Ka 

As in m and [T2J, the set Ka will be a finite union of 2^ disjoint sets of consecutive integers 
with same cardinality spaced according to a recursive ‘Cantor”-like construction. Let 

S = 2 ^; and £ := i A = sup {k € N* : AS ^ ' > 2} . 

Note that £ < log A/ log 2 and 5 < 1/2 (since A ^ 2). Let no = A and for any j € { 1 ,... ,£}, 


n. 


= \ 


A(l-8)j- 

2i 


and dj-\ = n 7 _i — 2n 7 


(16) 


For any nonnegative x, the notation \x~\ means the smallest integer which is larger or equal to 
x. Note that for any j € {0, ...,£ — 1}, 


dj ^ 


A5( i - sy _ as( i - sy 
2 ^ 


2-? " 2i +l 

where the last inequality comes from the definition of £. Moreover, 

A(l-8Y A{l-SY 

nt < K ’ + 1 < 


2 e 


2 i-i 


(17) 


(18) 


where the last inequality comes from the fact that 


as (i - sy - 1 
2f 


1 — 5 


^ 2 by the definition 


of £ and the fact that 5 < 1/2 

To construct K a we proceed as follows. At the first step, we divide the set { 1 ,... , A} into 
three disjoint subsets of consecutive integers: Iij, Iq 1 and Ji )2 . These subsets are such that 
Card(/i i i) = Card(/i, 2 ) = ni and Card(/o ,) = do- At the second step, each of the sets of 
integers fqj, i = 1,2, is divided into three disjoint subsets of consecutive integers as follows: for 
any i = 1,2, I\ A = where Card(/ 2 , 2 i-i) = Card(/ 2 , 2 i) = n 2 and Card(/^) = d\. 

Iterating this procedure we have constructed after j steps (1 < j < £a), 2 j sets of consecutive 
integers, Ij i7 , i = 1,.. . , 2 J ", each of cardinality rij such that aj^k ~~ ^j, 2 k-\ — 1 = dj-i for any 
k = 1,..., 2 J 1 , where a^i = minjfc € / 7 , ? ;} and bj A = maxjfc € Moreover if, for any 

i = l,..., 2' 7-1 , we set I*_ lt = {b jt2 i -1 + 1, ■ • •, a j>2i - 1}, then = Ij, 2 i-i U I*_ Xi U I jt2i . 













After £ steps we then have constructed 2 l sets of consecutive integers, Ip^, i = 1,... , 2 , each 
of cardinality np such that Ip^i- 1 and Ip, 2 i are spaced by integers. The set of consecutive 

integers Ka is then defined by 

2 l 

K a = J It k . 

k =1 


Note that 


Therefore 


v4} = K a U (U^~q L)f =1 



l -1 2 J 


^-1 


Card({l,..., A} \ K A ) = 

J=0 i=l i=o 


A — 2 £ np. 


But 


t -1 

A - 2*n* < A(l - (1 - 5) e ) = A5^(l - S) 3 < A5£ < 

j =o 


A 
2 ' 


(19) 


Therefore A ^ Card(iGi) ^ A/2. 

In the rest of the proof, the following notation will be also useful: for any k £ {0,1,... ,£} 
and any j £ {1,..., 2 k }, let 


j2 e- k 

Kk.j '■= K A ,k,j = U h,i (20) 

i=(j-l)2 l ~ k +l 

Therefore Kq_\ = K A and, for any j £ {1,..., 2 1 }, Kpj = Ipj. Moreover, for any k € {1,..., £} 
and any j £ {1,..., 2 fc 1 }, there are exactly c4_i integers between K^^j-i and K^^j- 

4.2.2 A fundamental decoupling lemma 

We start by introducing some notations, then we state the decoupling Lemma [8] below that is 
fundamental to prove Proposition [TJ Let Ka be defined as in Step 1. In the rest of the proof, 
we will adopt the following notation. For any integer m € {0,... ,£}, (Yj m ^)i<j< 2 rn will denote 
a family of 2 m mutually independent random vectors defined on (12,M, P), each of dimension 
Sd,e,m ■= d 2 Cavd(K m j) = d 2 2 l ~ m np and such that 

Vj m) =®(X i5 z£ K md ). (21) 

The existence of such a family is ensured by the Skorohod lemma (see HS|)- Indeed since 
(12, A, P) is assumed to be large enough to contain a sequence (dj)iez of iid random variables 
uniformly distributed on [0,1] and independent of the sequence (Xj)j^ q, there exist measurable 
functions fj such that the vectors V^”" 1 = /^((X,;,? £ K m i ; )f ;= i t ___j,8j'j, j = l,...,2 m , are 
independent and satisfy (12T1) . 

Let 7r| m be the i-th canonical projection from onto IC^ 2 , namely: for any vector 

x = (xj ,i £ K m j) of K Sd '*> m , 7r- m \x) = Xj. For any i £ K m j, let 

X ; m) (i) = 7r i (m) (vj m) ) and §J m) = £ xj m) (i), (22) 

where xj m ^(z) is the d x d random matrix associated with (recall that this means that 

the [k ,£)~th entry of Xj rn \i ) is the ((£ — 1 )d + L)-th coordinate of the vector X^ (i)). 
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With the above notations, we have 


ETr(e* Eieif A Xi ) = ETr(e tS i"') . 


(o), 


(23) 


We are now in position to state the following lemma which will be a key step in the proof of 
Proposition |7| and allows decoupling when we deal with the Laplace transform of a sum of self 
adjoint random matrices. 


Lemma 8 Assume that 


holds. Then for any t > 0 and any k € {0,..., l — 1}, 


ETr< ETr 


e t££Ts? +1) 


1 + Pd k + lh 


tMnn 2 l ~ k 


where (S^)y =lj 2 k the family of mutually independent random matrices defined in 
Proof. Note that for any k € {0, ... ,£ — 1 } and any j € { 1 ,..., 2 fc }, 

Kk,j = Kk+i,2j-i U Kk+i,2j 
where the union is disjoint. Therefore 

sf = S$5 + S$ and v f' 1 = (vfm vJ5), 

where S<‘> := S$ := E«=*, + ,, M xf(i), V»> := (xf (i),i 6 % W -i) 

and := (X^(i), i € iPfe+i,2j)- Note that there are exactly dk integers between 14+1,2.7-1 
and 14+1,2.7 and that for any i € {1, ..., 2 fe+1 }, 


Card(X fc+M ) = Card(14+i,i) = 2 £ ~^ 


n e . 


Recall that the probability space is assumed to be large enough to contain a sequence 
(5i,rji)i e z of iid random variables uniformly distributed on [0, l] 2 independent of the sequence 
(Xj)j^o- Therefore according to the remark on the existence of the family (Vj)i<j< 2 m made at 
the beginning of Section 14.2.21 the sequence ('/■/*)iez is independent of (V d "' 1 ) 2 <j < 2 m . According 
to Lemma[6]there exists a random vector V^ 2 of size d 2 Card(Afc+1,2) with the same law as V^ 2 

that is measurable with respect to <7(771) V cr(V^) V cr(V^ 2 ), independent of <r(V^) and such 
that 


P(V$ + VW) = ^(VfV(Vg)) < A4+1, 

where the inequality comes from the fact that, by relation (1141) . the quantity /3(cr(V^), cr(V^ 2 )) 
depends only on the joint distribution of (vj /d , V^ 2 ) and therefore, by ED, 


r(k)\ _ 




T {k) , 


/%(vg>),a(vg)) = ^(a(X, ,i G 14+i,i), ^4 K k+lj2 )) < /3 dfe+1 

by construct] 

For any i € K k+ 1 2 , let 


Note that by construction, V^ 2 is independent of <j(V^, (V^) J - =22 fc). 


Xg(z)=4'' +1, (vg and S[ k >= £ X$(*), 

iGA'fc+1,2 

where X^ 2 (i) is the d x d random matrix associated with the random vector X^ fc 2 (i). 
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With the notations above, we have 


ETrexp ( 1 V ■'j ~ E ( 1 v^=v' fc 2 )Trexp 




+ E 


3 = 1 


J=1 


1 vf)^v« Trex P ^E S ' 


(fc) 


3 = 1 


< ETrexp (tsg + iS$ + Sf } ) +E(lexp . (24) 


j( fc ) 


h fc ) 


J=2 


1=1 


(With usual convention, is the null vector if £ > 2 k ). By Golden-Thompson inequality, 

we have 

2 k 


TV exp yt ^2 — Tr ye t§1 ■ e t 

3 = 1 


2* o(fc) 
=2^ 


Hence, since <r(V^ , j = 2,, 2 fc ) is independent of cr(V, V’jTj, V 1 ,^), we get 

2 fc 

E(l~( fc )^ v ( fc )TVexp (t ^§f)) < Tr(E(l~( fc )^ v ( fc )e' s i fc) ) • Eexp (t^sf)) . 




3 = 1 


1=2 


Note now the following fact: if U is a d x d self-adjoint random matrix with entries defined on 
(f2,^4, P) and such that A max (U) < b a.s., then for any T £ A, 

lre u e 6 I d lr a.s. and so A max E(lre u ) < e fc P(r). 

Therefore if we consider V a d x d self-adjoint random matrix with entries defined on (fl,Al, P), 
the following inequality is valid: 


Tr(E(l r e u )E(e v )) < e 6 P(r) • ETr(e v ). 


(25) 


Notice now that (X.[ k \i) ,i £ K/^i) has the same distribution as (X, , i £ Kk,i)- Therefore 
A m ax(X^(i)) < M a.s. for any i, implying by Weyl’s inequality that 

A may ltS^) < tMCard(iTfc ; i) = tM2 £ ~ k ne a.s. 

Hence, applying (E5D with b = tM2 e ~ k n^, T = {vfij 7- V^} and V = t Ylj =2 and taking 
into account that P(T) < fid k + i , we obtain 


E 


A 

1 v(^ v f) Trexp (*5Z S ?°)) - Afc+1® 


l tn e 2 e ~ k M 


3 = 1 


ETV’ exp ^ §)-‘ 
1=2 


(fc) 


(26) 


Note now that if V and W are two independent random matrices with entries defined on (fl, A, P) 
and such that E(W) = 0 then 

ETrexp(V) = ETrexp (E(V + W|<r(V))) . 

Since Troexp is convex, it follows from Jensen’s inequality applied to the conditional expectation 
that 

ETr exp(V) < E(E(TV e v+w |cr(V))) = E(Tre v+w ) . (27) 

Since E(X^(i)) = E(Xj) = 0 for any i £ Kk,i and S^) is independent of u(S^,j = 

2, ...,2 k ), we can apply the inequality above with W = + S^) and V = 

Therefore, starting from (l26l) and using (1271) . we get 


El 


3 = 1 


TV exp (t ^ Sf )) < p dk+1 e tn * 2 " M ETV exp (t (sg + S$ + ]T S 


(fc) 

1 


3 =2 


(28) 
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Starting from (1M1) and considering (128]) . it follows that 


ETr exp < (1 + P dk+ ie tn ^~ kM ) ETVexp (t( S$ + S$ + E S ?°)) • ( 29 ) 

i=i j=2 

The proof of Lemma [8] will then be achieved after having iterated this procedure 2 k — 1 times 
more. For the sake of clarity, let us explain the way to go from the j-th step to the (j + l)-th 
step. 

At the end of the j-th step, assume that we have constructed with the help of the coupling 
Lemma El j random vectors V, i = 1, ■ ■ ■, j, each of dimension d 2 Card(AVi-i,i) and satisfying 
the following properties: for any i in {1,... , j}, is a measurable function of (V-^, , Vi)- 

it has the same distribution as V- j, is such that P(V ) 2 / Vj 2 ) < /3d k +i, is independent of 
and it satishes 

ETr exp (t E } ) < (l + /3 dk+1 e tn ‘ 2 *~ kM ) j • ETr exp (t + §g>) + t ^ §f } ) , (30) 

3=1 i=l *=3+1 

where we have implemented the following notation: 

§g = E xgw- (3i) 

reK k+ i i2i 


In the notation above, (r) is the d x d random matrix associated with the random vector 
X,. 1 ^ (r) of K d2 defined by 


x ^2 ( r ) = 4 fc+1) ( v i fc 2 ) for ally r € iffe+i,2i • 

Note that the induction assumption above has been proven at the beginning of the proof for 
j = 1. Moreover, note that since, for any m G {1,..., £}, )i<j< 2 ™ is a family of independent 

random vectors, the random vectors V) 9 , i = 1,..., j, defined above are also such that, for any 

i € {1,... , j}, V$ is independent of ^((V^E=i,---,*’ (V^2 )^=i,-,*-i> ( V f ) )^=i+i,...,2'=)• 

Now to show that the induction hypothesis also holds at step j + 1, we proceed as follows. 
By LemmaEl there exists a random vector 2 of size d 2 Card(iLfc-|_i,i) with the same law as 

2 , measurable with respect to <7(77,7+1) V <t(vE V <r(vE 2 ), independent of <7(v(+-l x ) 
and such that 


nvf+L ^ v« i2 ) < & 1+1 


(32) 


(The inequality above comes again from (1211) and the equivalent definition (1141) of the /3- 
coefficients). Note that by construction, o‘((v(^)j = i i ...j + i, (vgVw.^’w+w) and 
a(vf; } i 2 ) are independent. With the notation (13TT) . we have the following decomposition: 


j 2 k j +1 2 k 

ETrexp (t j + §<j) +1 V sf j < ETr exp (t £(S< j + S< j) + t V S< 


(fc) 


2=1 


*=3+1 


i=l 

3 


*=3+2 


+ E( l^(fc) 


3+1,2+ v j + l,2 


Trexp (t E( S li + S l,2 ) + * E 

*=3+1 


(fc) 


2=1 


(33) 
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Using Golden-Thompson inequality, we have 


Trexp [t ^(§[1 + +t ^ E> z 

i=j +1 


(*) 


%— 1 


< 


Tr y exp (tSfh) • exp (*^(§-,1 + S g') + 4 £ 

i=j+2 


(*) 


i=l 


Hence, since the sigma algebra generated by ((V.-j )i=i....j, (v[(vj^)i=:/+2,...,2 fc ) 
independent of that generated by (V^ 1, 2 , v(+, 2)1 we S e ^ 


is 


ETr (V ( E <=i $!? +S$)+E?= i+1 S< fc) ) 


V J+l,2? V j+ 1 ,2 


< Tr Ee ; 




) • E(l 


e J + 1 l-y-(fc) , v (fc) 

v j+l,2? tv j+l,2 


))• (34) 


By Weyl’s inequality, 


Using that V^, = v (X,; , i € Kk,j+i ) and that A max (Xj) < M a.s. for any i. it follows that 

Amax(*Sj-+i) < tMCard(K kjj+ i) = tM2 l ~ k n k a.s. 

In addition, we notice that 1 + 2 is independent of Ei=i(^fi + ^ 2 ) + Yli=j+ 2 ^^ 

and, since V^ 2 = v V^ 12 an d v *+i = D ( x * ¥ € iFfcj+i), E(S^ 1;1 + ^+ 1 , 2 ) = 0 Therefore, 
starting from (IMP and taking into account (1321) . an application of inequality (051) with b = 
tM2*- k n h T = {V<$ 1>2 ± Vf + } li2 } and ¥ = t(ZL ^ + Ig) + Allowed by an 

application of inequality ( 1271 ) with W = t(S^ l 1 + 2 ), gives 


Amax(iSj+i) < * ^ A max (x(^ 1 (r)) a.s. 

r£K kJ+1 


ETr (V ( E '=i $!? + s ft ) )+^+i ) 


/T)‘(fc) 


< f 3 d k +ie tne2e kM x ETi^e t ( E ^i (S ?i +S ^ 2))+E i+ 2S ^ ) )) . (35) 
Therefore, starting from (1331) and using ([35]) . we get 

ETr (V (E U K1 +*$ )+E?=j+i sl fc) )) 

< (1 + Pd k +i^ tne2e ~ kM ) X ETr^e t ( E (=i ( ^ fe i +§ i fc2)+E ^+ 2 ^ 
which combined with (1301) implies that 


ETr(e* E ?= lS r) < (l + /3 dfc+ ie fal ^ * M ) j+1 X ETr^^i^ 1 ^^ 

proving the induction hypothesis for the step j + 1 . Finally steps of the procedure lead to 

° k ¥ fc )\ _/ +w2 fc «¥) , 5( fc )-, 


ETr(e tE *= lS * ) < (l + A44-ie to ^ feM ) 2fc x ETr(e tE *= l(Si . 1+Si . 2) ) . 


(36) 


To end the proof of the lemma it suffices to notice the following facts: the random vectors 


Vf?,vi 5 , i = 1,..., 2 fe , are mutually independent and such that V)^ = u V 2 Ei ; an d V) 2 ; = 


( fc ) ir( fc +l) 


A k ) _X> 
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vj +l) . In addition, the random vectors v| fc+1 \ i = l,...,2 fe+1 , are mutually independent. 
This obviously implies that 


ETr 


J- Si=l( 




y i, 1 ' 


+S/7A _ 


= ETr e 




which ends the proof of the lemma. □ 


4.2.3 Proof of Proposition [7| 

We shall prove Inequality (fT5l) with Ka defined in Section 14.2.11 

Let us prove it first in the case where 0 < tM < 4/A Since by Weyl’s inequality, 

Amax( x 0 ^ J2 A max (Xi) < AM, 

ieK A i£K A 

and E(Xj) = 0 for any i € Ka, it follows by using Lemma [4] applied with K = {1} and 
Ui = E ieK A Xj that, for any t > 0, 

ETr(e* E ^ A ^) < dexp (t 2 g(tAM) A max (E( ^ X*) 2 )) . 

i£K A 

Therefore by the definition of v 2 , since g is increasing, tAM < 4 and g(A) < 3.1, we get 

ETr(e <Eieif -4 Xl ) < dexp(3.1 x At 2 v 2 ), 

proving then (1151) . 

We prove now Inequality (fT5l) in the case where 4 /A < tM < min Q, 32 ^ a ) • Let 

« = ^ an d k(t) = inf j keZ: A(( 1 - §)/2) k < min ( 37 ) 

Note that if f 2 M 2 < k/A then k(t) = 0 whereas k(t ) > 1 if f 2 M 2 > n/A. In addition by the 
selection of Ia, A(( 1 — 6)/2) e < 4/<5. Therefore k(t ) < Ia since (tM) 2 < c5/ 32. Then, starting 
from (1231) . considering the selection of k(t ) and using Lemma 0 we get by induction that 

_ *:(*)-! 2 k 2 fc (‘) 

ETr exp (t ]T X,) < J] (l + f5 dk+1 e tMn ^~ k ^j ETr exp (t ^ S< fc(t)) ) , (38) 

i&K A k =0 j =1 

with the usual convention that ]/[/7=o a k = 1- Note that in the inequality above, 2fc(t) 

is a family of mutually independent random matrices defined in (1221) . They are then constructed 
from a family ) 1< j <2 k(t) of 2 fc d) mutually independent random vectors that satisfy (1211) . 

Therefore we have that, for any j € 1,..., 2 fc d) ; = r> X j. Moreover, according 

to the remark on the existence of the family (vj fc ^^) 1<J<2 fe( t ) made at the beginning of Section 

14.2.21 the entries of each random matrix §^*1) are measurable functions of (Xj,<5j)j g ^. 

( 2 fe ( 4 ) (k(t))\ 

t J2j =1 ) • 

With this aim, let p be a positive integer to be chosen later such that 

2 P < Card(A" fc(f)j ) := q . (39) 

Note that q = 2 an d by (f]~9l) 

X 

9 ~ 2 fc ( t )+ 1 ’ 
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Therefore if k(t) = 0 then q > A/2 implying that q > 2 (since we have 4 /A < tM < 1). Now 
if A;(t) > 1 and therefore if t 2 M 2 > k/A, by the definition of A;(t), we have q > and then 

q > 2 since (tM) 2 < k/2. Hence in all cases, q > 2 implying that the selection of a positive 
integer p satisfying (1391) is always possible. 

Let m qtP = [q/(2p)\. For any j € { 1 , ..., 2 fe W}, we divide Kk(t),j i n t° 2 m q:P consecutive 
intervals (J^ f ^ , 1 < i < 2 m qtP ) each containing p consecutive integers plus a remainder interval 
Jj 2 m qp +i containing r = q — 2prn q;[) consecutive integers. Note that this last interval contains 
at most 2p — 1 integers. Let (k) be the d x d random matrix associated with the random 

vector defined in ([52)1 and define 


r^(k(t)) 

Jl l 


X 


lf (t) \k). 


k&K Kt)ij nj) k x (t)) 


With this notation 


g(fc(t)) 


o + l 


X 


+ X z 5‘ 


(*(*)) 
2 i 


2=1 2=1 

Since Tr o exp is a convex function, we get 


(40) 


2 fc W 


ETrexp (t £ §f (4)) ) < ^ETrexp (2 1 £ £ Zg™) +^ETrexp (22 £ £ 
J=1 j=l i=l 


2 k ( t ) m qp +1 


1 . 


2 fc (*) iri q 


(41) 


We start by giving an upper bound for ETrexp ^2tY^=\ 
define the following vectors 

r(k(t)) _ / Y (^W)/ 


jm} 

j =l i=l 

. With this aim, let us 


uj?" = (xf Wj (fe), k € n jjf )} ) and Wf= (U^ Wj , t G {1,..., 2m ? , p + 1}) . 

(42) 

Proceeding by induction and using the coupling lemma El one can construct random vectors 
j = 1, ..., 2 k ^\ i = 1 ,..., m 5iP , that satisfy the following properties: 

(i) (U* ( 2 f», (j, z) G {1,..., 2 fc h)} x {1,... , m ?iP }) is a family of mutually independent random 
vectors, 

(ii) U*2^ has the same distribution as U^ 4 ^, 

(hi) P(U*g (4)) + ug 4)) ) < /3 p+ i . 

Let us explain the construction. Recall first that (fi, *4, P) is assumed to be rich enough to contain 
a sequence (r/i)«gz of iid random variables with uniform distribution over [0,1] independent of 
(Xj,5j)j e z (the sequence (5«)igz has been used to construct the independent random matrices 

g(fc(t)) 


(*(*)) 


Am) 


u 


m) 

j, 2 


, j = 1, ...,fc(i), involved in inequality (1551) 1. For any j € {l,...,2 fc W}, let U*^ 4 ^ 


, and construct the random vectors , * = 2,..., m 9iP , recursively from (U*^ 4 ^, 1 < 


£ < i — 1) as follows. According to Lemma El there exists a random vector such that 

T *(fe(t)R. TT (fc(t)) .,.1 (43) 


jj*(fc(t)) _ n TtI+W) \ 


where /,;j is a measurable function, U* 2 i has the same law as U- 2 4 , is independent of 
1 < £ < i — l) and 

P(u;g (4)) + ug 4)) ) = p(a( U*?f \ 1 < £ < i - 1),a(u£f)) < /3 p+ i • 
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By construction, for any fixed j G {1,..., 2 fc W}, the random vectors U*^/ 4 ^, i = 1,, m qiP , 
are mutually independent. In addition, by (14311 and the fact that (W= 1,..., 2 fc 4) is a 
family of mutually independent random vectors, we note that ('i, j) G {1,... ,m qjP } x 

{l,...,2 fc ^}) is also so. Therefore the constructed random vectors U* 2 )( 4 ^ * = 1,..., ra 9iJ) , 
j = 1,..., 2 fc ^\ satisfy Items (i) and (ii) above. Moreover, by (1431) . we have 

a(U*J (t)) , i < £ < i - 1) C <j{ 1 < 4 < i - l) V <r(^ +(j -_ 1)2fc( t), 1 < 4 < * - l) . 

Since (r/j)j e z is independent of (Xj,(5i)j e z, we have 

0(a(U*<* (t)) , 1 < ^ < * - l),a(ug? } )) < ^(ugg”, 1 < 4 < i - l),a(uj. fc 2 f)) . 

By relation (]TT|) . the quantity /3(o-(U^^, 1 < 4 < * — l), u(Tj( fc 2 4 ^)) depends only on the joint 
distribution of ((U^j^)i<g<j_i,u( fc 2 4 ^). By the definition (l42ll of the TJ^^’s, the definition 
(|22l) of the x( fe ^(A;)’s and (f2p) . we infer that 


= p(a(X k , fc G uj"i K km n jf 2 f), cr (Xfc , k G n jg? )} )) < £ p+ i. 

So, overall, the constructed random vectors U*^ 4 ^ i = 1,..., m q , p . j = 1,..., 2*4), satisfy also 
Item (iii) above. 

Denote now 

x* ( i (t)) w=^(u*s (4)) ) 

where 7T.[”^ is the 4-th canonical projection from K4* 2 onto K'* 2 , namely: for any vector x = 
(xj ,i G {1,... ,p}) of K^ 2 , n e (x.) = x e . Let X* ( 2 / 4)) (4) be the d x d random matrix associated 
with x;(£< 4 »(4) and define, for any i = 1 ..., m ?iP , 

z;£ (t)) = e x ;S (t)) w- 

^fcw.J n 4« )) 

Observe that by Item (ii) above, Z*^ 4 ^ = r> Z^ 4 ^ (where we recall that Z^ 4 ^ is defined by 

gO])) and by Item (i), the random matrices Z* 2 ’^ 4 ^, i = 1,..., m giP , j = 1,..., 2*4) 5 are mutually 
independent. The aim now is to prove that the following inequality is valid: 


2 fe W 

ETr exp ^24 XX« ,) )s(l + K,-»e’‘ M 
4=1 *=i 


2 fe(t) 


ETr exp (24 ^ 

4=1 *=1 


• (44) 


Obviously, this can be done by induction if we can show that, for any 4 in {1,... , 2*4)}, 


4-1 2 fc W rn qi p 

ETYexp ( 2f x: X 44 +4E 44) 

J = 1 2—1 j=£ 2=1 


< (l + (■ m qiP - l)e qtM 


ETr 


i m q p 

EE 

4=1 i=l 


ex P (2 t EE4 (,|) +^ E E44) 


2 fc (*) m q:P 

E E 

4=4+1 i=l 


(45) 
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To prove the inequality above, we set 


m qtP 


2 fc(t) , 


«<» 


Q- w w -v'EY, Z 1 “ + 2t E E %,2i 

J = 1 2=1 J=£ 2=1 

and we write 

27 ^q,p 

ETrexp (Q_i^(i)) = E^ l u (Mt ))=u «(Mt))Tr exp (Q_i^(t))) 

. u<f<*^ exp ( C *-M 


1=2 
+ E(l_ 


< ETrexp (Q, m (t)) + ®(l 3ie{2j (Q-v(*))) • (46) 

Note that the sigma algebra generated by the random vectors (U*^)iG{i,...,m q p }j'e{i,...,^-i} an d 

( U j§i)iG{i,...,m 9 , p },ie{£+i,..., 2 Mt)} is independent of ^((U^, . This fact together 

with the Golden-Thomson inequality give 

e( 1 


( 1 3ie{:2,..., m9iP }:UfWVu; ( 2 f )),IVeXP ( C <-M 


< 


!,2t 

i—l m qtP 


2 k (t) 


Tr(E( e xp(2(^^Z*«'»+2t £ £ 

jf=l i=l j=£+l i=l 


^(k(t)) 

j,2i 


x E( , . TT (fc(t))^ TT . ( fc(t)) exp (2t ^ 


3ie{2,...,m s , p }:U^VU*^ 

By Weyl’s inequality and ()21l) . we infer that, almost surely, 

277-0, v 

7 (fc(t)) 


(fc(t)) 

2i 


2=1 


A max (2t ^ ^2 A max (X fc ) < 2tm q . p pM < tqM . 


(47) 


2=1 


i_1 fcGJf fcC t )>i nJ^ t)) 


Therefore, applying (|25j) with 6 = tqM , T = {3z € {2, } and 

V = 2 1 YiT=i ^*j%^ + 2t Y?j=e +1 XS P and taking into account that 

277 q,p 

P(T) < £ P(u£f + U* ( 2 *f ») < (m 5>p - 1)^+1 , 


i=2 


we get 

e ( 1 


rWOl^u^WiTrexp (Q-i/ 


V 3ie{2,...,m 9lP }:U^VU, iai 

£-1 ra, J, p 2 fc W m 9 , P 

< (m 9iP - l)/3 p+ ie 9tM ETrexp ( 2! EE z S <,)>+ 2t E E Z S^ 

J = 1 2=1 j=i-\-l 2=1 




Using that the sigma algebra generated by the random vectors (U-y)^^ ...,m qp },je{i,-,£-1} 


and (Uj >2 J i )ie {1 ,...,m 8ip} je { /+ 1 ,...,2fc(*) } is independent of ^((Uj 2 /) ie{ i,..., m ^ } ), and noticing that 
by construction, E(Z^^) = E(Z^*^) = 0, an application of inequality (071) then gives 


E 1 


3ie{2,...,m 9 , p }:U^Vu* 


(»=(*)) -rjUMUiTA'exp (Q_i/ 


< P p +i(m qtP - l)e ?tM ETr exp (C^+i(f)) . (48) 
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Starting from (jj6j) and taking into account (l48lh inequality (T45l) follows and so does inequality 

AMD. 

With the same arguments as above and with obvious notations, we infer that 

2 fc (*) m q , p+ 1 

ETVe X p(2f]T -£ Z«*») 

1 = 1 *=1 

2 fc(t) m ?,P+l 

< (l + m q , p e 2(?tM /3 p+1 ) ETrexp(2t^ E Z*?/-!?) . (49) 

1=1 »=1 

Note that to get the above inequality, we used instead of (j47l) that, almost surely, 

1 ^9jP+1 

A max (2t Z 2 m-i) < 2 * E E A max(X fc ) 

* =1 * =1 k ^ K Ht),£ 

< 2 Mt(m qyP p + q — 2pm cpp ) = 2Mt(q — pm q>p ) < Mt(q + 2 p) < 2tqM . 
Starting from (1381) and taking into account (1TO , (EHl) and (H9l) , we then derive 


ETr exp (t e*. s n 1 + Pd k +ie 


ieK A 


ofe(t) fc (*)— 1 

n 

k =0 


fMn e 2 e 


2 k ( t ) rn q , p 


Vrexp(2t^gz* ( 2 f )) ) + lETVexp(2t^ E Z S-i)) • ( 50 ) 


2 fc W m 9 , P +l 


(*(*)) 


1=1 i=1 


1= 1 *=1 


Now we choose 


P = 


A 


tMJ L 2J 


Note that the random vectors (Z ^ 2 i-i)hj are mutually independent and centered. Moreover, 

2A max (Z*g ( ! ) 1 ) ) < 2Mp < j a.s. 

Therefore by using Lemma[3]together with the definition of u 2 and the fact that 2 k W (m qq ,+l)p < 

2 fc Wg < we g e t 


2 fe (‘) m,, p +l 

ETr exp ^2f ^j 2 i-i) — ^ ex P(4 x 3.1 x At 

1=1 i=l 


V) 


(51) 


Similarly, we obtain that 


2 fc (*): 


ETr exp ^2t Z* 2 )' j < dexp(4 x 3.1 x At 2 v 2 ). 

1=1 i=l 


(52) 


Next, by using Condition (j5j) and (fT9lh we get 

2 fc(t) 


log (l + m 9iP e 2f9M ^ + i)' ' 7 < 2 k ^m q}P e 2tqM e- cp < ^-e 2tqM e- cp . 


(53) 
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Several situations can occur. Either ( tM ) 2 < k/A and in this case k(t) = 0 implying that 
A/2 < q < A < n/(tM) 2 . If in addition q > 4 /(tM) then p = [2 /(tM)] > 1 /tM (since tM < 1) 
and 

2tqM Q -cp < AtM 2n/(tM) -c/(tM) < 3c /( 4tM ) < (tM) 3 e /(i 6t jyf) 

2p “2 “2 - c 

where we have used that log 2 A < 32 / M , > 2, and e ~ 3c /( 8tM ) < ££M f or the last inequality. If 

otherwise q < 4 /(tM) then p = [q/2] > q/ 4. Hence, since 2tM < c/16 (since logH > log 2) and 
tM > A/A, 

A c 2 tqM c ~cp < ^_ e -3cg/16 < ^ g -3cA/32 < ^j^ e ~3c/{8tM) < (^0_ e -3c/(32tM) 

2p ~ q ~ ~ ~ c 


where we have used that A/2 < q for the second inequality, and that log 2 A < 32 / M , A > 2 and 
-3c/(i6tAf) < mm f th last one 

— oc 

Either (tM) 2 > k/A and in this case k(t) > 1 and by using (1191) and the definition of k(t), 
we have 

q - 2 Ht)+1 - A(tM) 2 ' ^ 

If in addition q > 4 /(tM) then p = [2/(fM)] > 1 /tM, and by (fTBl) and the definition of k(t), 


q<2A 


(1 ~ S) 1 

2 fc h) 


< 


2 K 

(tM) 2 ' 


It follows that 

A p2t q M n -cp ^ AtM n 4 K /( tM ) n _ c /( tM ) ^ AtM n _ c /( 2tM ) ^ (tM) 2 n _ c /(8tM) 

C C ' _ v_/ C \ _ C ^ v 

2 p - 2 - 2 - c 


where we have used that log 2 A < - i2tM , H > 2 and e ~ c /( 4tM ) < AtM. f or the last inequality. Now 
if q < 4 /(tM) then p = [g/2] > g/4. Hence, using again the fact that 2 tM < c/16 combined 
with ([MD , we get 


—q 2 tqM e -cp < 8 A(t.M) 2 ^ 3cq/ie < 8 2 A(tM) 2 
2 p ~ k ~ c 


g 16x4x8 (tM) 2 < - g 

c 


8 (tM) 2 3c /( 3 2tM) 


where we have used that log 2 A < ^ 2 tM) 2 an< ^ 4l > 2 for the last inequality. 
So, overall, starting from ([53]) . we get 


log (l + m q , p e 2 ^ M p p+i y k(t) < ^l!g-3c/(32 tM) 


(55) 


( \ 2 k 

1 + Pd k +ie tMnt2,1 k J only in the case where (k/A) 1 / 2 < tM , 

otherwise this term is equal to one. By taking into account ©, (ED, (USD and the fact that 
tM < c5/ 8, we have 


k{t)—l 


!°g n ( i+ 


t+i e 


tMnfir 


k =0 


2k k(t) — 1 

< Y T ex p i - c 

k =0 
k(t )—1 

k ( 

— c 


A6(l-5) k „A(l-6) e 

y ’ + 2 tM v ’ 


< y 2fcex p 

k =0 


< 2 fc W 


exp 


2 k +i 

A5(l-5) k 
2 k + 2 

Ac5( 1 - syw- 1 
2 fc (*)+i 


2 fc 
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By the definition of k(t), we have A- - , -> -———. Therefore 2 k ^ < 2A . Moreover 

J w ’ 2 fc W~ 1 ( tM) 2 ~ K 

(1 -S)m-1 CK S 2 k 

2 k(t)+l * 4( iM )2 / tM ’ 

since tM < cS/8. It follows that 

log n (i + p dk+1 e tMn ‘ 2 *~ k y < 2A^^- exp ( - 2k/ (tM)) < ? ( 56) 

k =0 

where we have used the fact that log 2 A < 32 ^ M . So, overall, starting from (1501) and considering 
the upper bounds (I5TD . (JS2J, (1551) and (1551) . we get 

logETr exp (t ^ Xj) < logd + 4 x 3.lAtV + 9 ( tM ) 2 e ~3c/(32 tM) 
i&K A 

Therefore Inequality (1151) also holds in the case where 4 /A < tM < min (i, 321^^4 ) • This ends 
the proof of the proposition. □ 


4.3 Proof of Theorem |Tj 

Let Aq = A = n and ¥ (°\k) = X& for any k = 1,..., Aq. Let K Ao be the discrete Cantor type 
set as defined from {1,... ,A} in Section r4.2.11 Let A\ = Aq — Card(A'^ 0 ) and define for any 
k = l,...,A 1 , 

Y (1) (L) = X 4 where {h, • • • ,L4i} = {1,..., A} \ I< A . 

Now for i > 1, let I\ Ai be defined from {1, ..., Aj} exactly as K A is defined from {1, ... ,A}. 
Set A i+ 1 = Ai - Card (K Ai ) and {ji,... ,j Ai+1 } = {1, • • •, A} \ K Ai . Define now 

Y^ i+1 \k)=Y^(j k )iovk = l,...,A i+l , 


and set 

L = L n = inf {j € N* , Aj < 2} . 

Note that, for any i € {0,... , L — 1}, Ai > 2 and Card(iL J 4 i ) ^ Ai/2. Moreover Ai < n2~ l . 
The following decomposition clearly holds 


L—l 


E X * = E E ^ {i \k) + ^ {L \k) 


k=l 


i =0 keK A . 


k =1 


(57) 


Let 


Al 

Ui = Y(i \k) for 0 < i < L - 1 and U L = ^ Y (L) (A;), 

k£K A . k =1 


For any positive x, let 


h(c, x) = min 


/I clog2 \ 
v2’ 32log x) 


For any i € {0 ,... ,L — 1}, noticing that the self-adjoint random matrices (Y^\k))k satisfy the 
condition ([5]) with the same constant c, we can apply Proposition [7] and get that for any positive 
t satisfying tM < h(c,n/ 2*), 


logIETr(exp(fUj)) <logd + 


At 2 n2~ i (2v + y/3 x 2 5i / 2 M/(n Y2 y/c)) 2 
1 — tM/h(c , n2~ l ) 


(58) 
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On the other hand, by Weyl’s inequality, 


Amax(Ui) < MAl < 2M . 

Therefore by using Lemma [4] for any positive t , 

ETr(exp(fU L )) < dexp ( t 2 g(2tM)X max (E(i ] 2 L )) . 
Hence by the definition of v 2 , for any positive t such that tM < 1, we get 


log ETr (exp(tlU)) < log d + 2 t 2 v 2 < log d + y 


2 t 2 v 2 


-tM 


Let 


and 


Since 


we get 


M 


Hi = 


h(c, n/2 l ) 


for 0 < i < L — 1 and kl = M 


yn ( r- 2 l M \ r , r- 

(Ji = 2-^-— ( 2v + v3 x - ■= I for 0 < i < L — 1 and cr j, = vy/2 . 


2*/2 


L < 


ny/c. 

log n — log 2 
log 2 


+ 1 j 


L—l 


y Ki <M(y Mr-, + l) < ( 2 , 32 , l0gn ) = M 7 (c, n 

^ V h(c, n 2 l ) J- log 2 V’ clog 2 / ’ 


i=0 


^ h(c, n/2 i ) 


log: 


clog 2 


Moreover 


L—l 


2—0 2=0 


rx?l 2 y[c> 

< 1 4\/nv + 2c~ 1 / 2 n~ 2 M2 2L + v\[2 

< 15 \[nv + 2c~ 1 ^ 2 M . 


( 59 ) 


Taking into account (I58p and (1591) . we get overall by Lemma 0 that for any positive t such that 
tM < I/7(c, n), 


log ETr ( exp 


, ^ 2 2 n(l5u + 2M/(cn) 1 / 2 V 

(*!>)) < k, g ri+ i 1 _ tM7( / c )„; 


:=7 n(t). 


(60) 


To end the proof of the theorem, it suffices to notice that for any positive x 


P(A max (X>) > x) 

2=1 


< inf exp ( — tx + 'y n (t)) , 

t>0 : tM<l/j(c,n) 


where 7 n (2) is defined in (16(1 . □ 
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